Eurowordnet: a Multilingual Database of Autonomous and Language-specific Wordnets Connected via an Inter - Lingual-index

نویسنده

  • Piek Vossen
چکیده

This paper describes the multilingual design of the EuroWordNet database. The EuroWordNet database stores wordnets as autonomous language-specific structures that are interconnected via an Inter-Lingual-Index (ILI). In this paper, we discuss the possibilities to create mappings from each wordnet to the central ILI and how the ILI itself can be adapted to provide more overlap across the wordnets. We will argue that the ILI can be condensed to a more universal index of meaning, while the wordnets can still encode any fine-grained lexicalizations for each language.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

The Automatic Mapping of Princeton WordNet Lexical-Conceptual Relations onto the Brazilian Portuguese WordNet Database

Princeton WordNet (WN.Pr) lexical database has motivated efficient compilations of bulky relational lexicons since its inception in the 1980 ́s. The EuroWordNet project, the first multilingual initiative built upon WN.Pr, opened up ways of building individual wordnets, and interrelating them by means of the so-called Inter-Lingual-Index, an unstructured list of the WN.Pr synsets. Other important...

متن کامل

Multilingual Design Of EuroWordNet

This paper discusses the design of the EuroWordNet database, in which semantic databases like WordNetl.5 for several languages are combined via a so-called inter-lingual-index. In this database, language-independent data is shared and language-specific properties are maintained as well. A special interface has been developed to compare the semantic configurations across languages and to track d...

متن کامل

EuroWordNet: a multilingual database for information retrieval

The aim of the EuroWordNet-project is the development of a database with wordnets for English, Spanish, Dutch and Italian, similar to the Princeton WordNet1.5, which contains basic semantic relations between words in English. The Dutch, Italian and Spanish wordnets will be linked to the WordNet1.5 using equivalence relations. The resulting multilingual database can directly be used in (multi-li...

متن کامل

Categories and Classifications in Eurowordnet

In EuroWordNet we develop wordnets in 8 European languages, which are structured along the same lines as the Princeton WordNet. The wordnets are inter-linked in a multilingual database, where they can be compared. This comparison reveals many different lexicalizations of classes across the languages that also lead to important differences in the hierarchical structure of the wordnets. It is not...

متن کامل

MCR for CLIR

The Multilingual Central Repository (MCR) is based on the design of the EuroWordNet database. The MCR holds WordNets in various languages (English, Spanish, Italian, Catalan and Basque), which are interconnected via an Inter-LingualIndex (ILI). In addition, the MCR holds a number of ontologies and domain labels related to all concepts. This paper describes the integration and evaluation of the ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2004